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SYSTEM AND METHOD OF MANAGING AND MONITORING CLUSTER 

RESOURCES 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

[0001] The present invention relates to a resource management system and more 
specifically to a system and method of managing and monitoring cluster resources. 

2. Introduction 

[0002] Managers of clusters desire maximum return on investment often meaning high 
system utilization and the ability to deliver various qualities of service to various users 
and groups. A cluster is typically defined as a parallel computer that is constructed of 
commodity components and runs as its system software commodity software. A cluster 
contains nodes each containing one or more processors, memory that is shared by all of 
the processors in the respective node and additional peripheral devices such as storage 
disks that are connected by a network that allows data to move between nodes. 
[0003] The managers of such clusters need to understand how the available resources 
are being delivered to the various users over time and need the ability to have the 
administrators tune 'cycle delivery* to satisfy the current site mission objectives. 
[0004] How well a scheduler succeeds can only be determined if various metrics are 
established and a means to measure these metrics are available. While statistics are 
important, their value is limited unless optimal statistical values are also known for the 
current environment including workload, resources, and policies. If one could determine 
that a site's typical workload obtained an average queue time of 3 hours on a particular 
system, this would be a good statistic. However, if one knew that through proper tuning, 
the system could deliver an average queue time of 1.2 hours with minimal negative side 
effects, this would be valuable knowledge. 
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[0005] The present invention was developed to address these issues. At its core, it is a 
tool designed to truly manage cluster resources and provide meaningful information 
about what is actually happening on the system. It was created to satisfy real-world 
needs of a batch system administrator as he or she tries to balance the needs of users, 
staff, and managers. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0006] In order to describe the manner in which the above-recited and other advantages 
and features of the invention can be obtained, a more particular description of the 
invention briefly described above will be rendered by reference to specific embodiments 
thereof which are illustrated in the appended documents and drawings. Understanding 
that these drawings depict only typical embodiments of the invention and are not 
therefore to be considered to be limiting of its scope, the invention will be described and 
explained with additional specificity and detail through the use of the accompanying 
drawings. These drawings are found in the various documents found in the attached 
Appendices and will be referred to and explained in the respective document which 
includes the drawing. 

DETAILED DESCRIPTION OF THE INVENTION 

[0007] The details of the present invention will be understood with reference to the 
associated documents attached as Appendix A hereto and further includes a CD 
according to 37 C.F.R. 1.54(e) and 1.96. There are two copies of the CD (Copy 1 and 
Copy 2). Each copy contains the same identical set of documents. The enclosed CD 
Listing of Documents will set forth the documents and folders on the CD with an 
accompanying explanation of the subject matter of each document. Each document 
contained on the CDs is incorporated herein by reference into this patent application. 
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[0008] Embodiments within the scope of the present invention may also include 
computer-readable media for carrying or having computer-executable instructions or data 
structures stored thereon. Such computer-readable media can be any available media that 
can be accessed by a general purpose or special purpose computer. By way of example, 
and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, 
CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage 
devices, or any other medium which can be used to carry or store desired program code 
means in the form of computer-executable instructions or data structures. When 
information is transferred or provided over a network or another communications 
connection (either hardwired, wireless, or combination thereof) to a computer, the 
computer properly views the connection as a computer-readable medium. Thus, any 
such connection is properly termed a computer-readable medium. Combinations of the 
above should also be included within the scope of the computer-readable media. 
[0009] Computer-executable instructions include, for example, instructions and data 
which cause a general purpose computer, special purpose computer, or special purpose 
processing device to perform a certain function or group of functions. Computer- 
executable instructions also include program modules that are executed by computers in 
stand-alone or network environments. Generally, program modules include routines, 
programs, objects, components, and data structures, etc. that perform particular tasks or 
implement particular abstract data types. Computer-executable instructions, associated 
data structures, and program modules represent examples of the program code means 
for executing steps of the methods disclosed herein. The particular sequence of such 
executable instructions or associated data structures represents examples of 
corresponding acts for implementing the functions described in such steps. 
[0010] Those of skill in the art will appreciate that other embodiments of the invention 
may be practiced in network computing environments with many types of computer 
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system configurations, including personal computers, hand-held devices, multi-processor 
systems, microprocessor-based or programmable consumer electronics, network PCs, 
minicomputers, mainframe computers, and the like. Embodiments may also be practiced 
in distributed computing environments where tasks are performed by local and remote 
processing devices that are linked (either by hardwired links, wireless links, or by a 
combination thereof) through a communications network. In a distributed computing 
environment, program modules may be located in both local and remote memory storage 
devices. 
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